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The Biotechnology Systems Branch of the Scientific and Technical Information 
Center (STIC) detected errors when processing the following computer readable 
form: 



Source: 



Date Processed by STIC: 





01 


ll 


££>o| 



THE ATTACHED PRINTOUT EXPLAINS DETECTED ERRORS. 

PLEASE FORWARD THIS INFORMATION TO THE APPLICANT BY EITHER: 

1) INCLUDING A COPY OF THIS PRINTOUT IN YOUR NEXT COMMUNICATION TO THE 
APPLICANT, WITH A NOTICE TO COMPLY or, 

2) TELEPHONING APPLICANT AND FAXING A COPY OF THIS PRINTOUT, WITH A 
NOTICE TO COMPLY 

FOR CRF SUBMISSION QUESTIONS, PLEASE CONTACT MARK SPENCER, 703-308-4212. 



FOR SEQUENCE RULES INTERPRETATION, PLEASE CONTACT ROBERT WAX, 703-308-4216. 
PATENTIN 2.1 e-mail help: patin21hclp@,uspto,gov or phone 703-306-4119 (R. Wax) 
PATENTIN 3.0 e-mail help: patin3help(@uspto.gov or phone 703-306-4119 (R. Wax) 

TO REDUCE ERRORED SEQUENCE LISTINGS, PLEASE USE THE CHECKER 
VERSION 3,0 PROGRAM . ACCESSIBLE THROUGH THE U.S. PATENT AND 
TRADEMARK OFFICE WEBSITE. SEEBELOW: 



Checker Version 3.0 
The Checker Version 3.0 application is a state-of the-art Windows based software program 
employing a logical and intuitive user-interface to check whether a sequence listing is in 
compliance with format and content rules. Checker Version 3.0 works for sequence listings 
generated for the original version of 37 CFR §§1 821 - 1.825 effective October 1, 1990 (old 
rules) and the revised version (new rules) effective July 1; 1998 as well as World Intellectual 
Property Organization (WIPO) Standard ST 25 

Checker Version 3.0 replaces the previous DOS-based version of Checker, and is Y2K- 
compliant. Checker allows public users to check sequence listings in Computer Readable form 
(CRF) before submitting them to the United States Patent and Trademark Office (USPTOV 
Use of Checker prior to filing the sequence listing is expected to result in fewer errored sequence 
listings, thus saving time and money. 

Checker Version 3.0 can be down loaded from the USPTO website at the following address: 

http://www.uspto.gov/web/offices/pac/checker- 



Raw Sequence Listing Error Summary 



SUGGESTED CORRECTION 



SERIAL NUMBER: 



: ©1 



ERROR DETECTED 

ATTN: NEW RULES CASES: PLEASE DISRECARD ENCLISII "ALPHA* HEADERS, WHICH WERE INSERTED BY PTO SOD 



I Wrapped Nucleic* 

Wrapped Aminos 



The numbcrAext at Che end of each line "wrapped** down lo the next line. Jhii may occur if your file 
was retrieved in a word processor after creating it Please adjust your right margin to .3; this will 
prevent "wrapping,'* 



Invalid Line Length The rules require thai a line not exceed 72 characters in length, this includes white spaces. 



10 



II 



_ Misaligned Amino 
Numbering 

Non-ASCII 



_ Variable Length. 



Palentln 2.0 

-bug" 



_Skipped Sequences 
(OLD RULES) 



_Skipped Sequences* 
(NEW RULES) 



_Usc of n*s or Xaa's 
(NEW RULES) 



Jnvalid <213> 
Response 

Use of<220> 



12 Palenltn 2.0 

"bug" 



The numbering under each 5* amino acid is misaligned. Do not use Ub codes between numbers; 
use Bp ace characters, instead. 

The submitted file was not saved in ASCII(DOS) text, as required by the Sequence Rules. Please 
ensure your subsequent submission b saved In ASCII text. 

Sequences) contain n's or Xaa's representing more than, one residue. Per Sequence Rules, 

each it or Xaa can only represent a single residue. Please present the maximum number of each 
residue having variable length and indicate in the <220>-<223> section that some may be missing. 

A "bug" in Patentln version 2.0 has caused fiTe <220>*<223>»ection to be missing from amino acid 

sequencers) . Normally, Patentln would 1 automatically, geri era ite Ihis section from the 

previously coded nucleic acid sequence. Please manually copy the relevant <220>-<223> section to 
the subsequent amino acid sequence. This applies lo the mandatory <220>-<223> sections for 
Artificial or Unknown sequences. 

Sequences) missing. If intentional, please insert the following lines for eaeh skipped sequence: 

(2) INFORMATION FOR SEQ ID NO:X: (insert SEQ ID NO where "X" is shown) 
(i) SEQUENCE CHARACTERISTICS: (Do opt insert any subheadings under this heading) 

(xi) SEQUENCE DESCRIPTIONS EQ ID NO:X: (insert SEQ ID.NO where "X" is shown) 
This sequence is intentionally skipped 

Please also adjust the "(ii) NUMBER OF SEQUENCES:" response lo Include the skipped sequences. 



Sequencers) _ 



missing. Iflntcntlonal, please insert Ibe following lines for each skipped sequence. 



<2 10> sequence id number 
<400> sequence id number 
000 

Use of n's and/or Xaa's have been detected in the Sequence Listing. 

Per 1.823 of Sequence Rules, use of <220>-<223> is MANDATOR V if n's or Xaa's are present. 

In <220> to <223> section, please explain location of n or Xaa, and which residue n or Xaa represents. 

Per 1.823 orSequence Rules, the only valid <213> responses are: Unknown, Artificial Sequence, or 
scientific 'name (Genus/species). <220>-<223> section is required when <213> response is Unknown or 
is Artificial Sequence 



Sequences) _ 



missing the <220> "Feature" and associated numeric identifiers and responses. 



Use of <220> to <223> is MANDATORY if<2 13> "Organism" response is "Artificial Sequence" or 
"Unknown." Please explain source of genetic material in <220> to <223> section. 
(See "Federal Regtstcr," 06/01/1998, Vol. 63, No. 104, pp. 29631-32) (Sec. 1.823 of Sequence Rules) 

Please do nol use "Copy to Disk" function of Patentln version 2.0. This causes a corrupted file, 
resulting in missing mandatory numeric identifiers and responses (as indicated on raw sequence 
listing). Instead, please use "File Manager" or any other manual means to copy file to floppy disk. 



13 ■ Misuse of n v n can only be used to represent a single nucleotide in a nucleic acid sequence. N is not used to represent 

any value nol specifically a nucleotide. 
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OIPE 



RAW SEQUENCE LISTING DATE : 09/18/2001 

PATENT APPLICATION: US/09/941,947 TIME: 10:40:16 

Input Set : A:\CL1903 US NA Seq Listing.txt 
Output Set: N:\CRF3\09182001\I941947.raw 

5 <110> APPLICANT: Brzostowicz, Patricia C. 

6 Cheng, Qiong 

7 DiCosimo, Deana J. ^ 

8 Koffas, Mattheos D06S NOt Comply 

9 Miner, Edwards, jr. Gbffected Diskette Needed 

10 Odom, J. Martin n n - 

11 Picataggio, Steve \ °**f- *" / 

12 Rouviere, Pierre E. 

16 <120> TITLE OF INVENTION: CAROTENOID PRODUCTION FROM A SINGLE CARBON SOURCE 

20 <130> FILE REFERENCE: CL1903 US NA 
C--> 23 <140> CURRENT APPLICATION NUMBER: US/09/941,947 
C--> 23 <141> CURRENT FILING DATE: 2001-08-29 

23 <150> PRIOR APPLICATION NUMBER: 60/229,907 

24 <151> PRIOR FILING DATE: 2000-09-01 

26 <150> PRIOR APPLICATION NUMBER: 60/229,858 

27 <151> PRIOR FILING DATE: 2000-09-01 
29 <16 0> NUMBER OF SEQ ID NOS : 6 0 

33 <170> SOFTWARE: Microsoft Office 97 
37 <210> SEQ ID NO: 1 
39 <211> LENGTH: 1311 
41 <212> TYPE: DNA 

4 3 <213> ORGANISM: Methylomonas 16a 

47 <400> SEQUENCE: 1 

48 gatgtggtca catggcccta tcacttaacg gctgatattc gattttgtca ttggtttttt 60 
50 cttaacttta acttctacac gctcatgaac aaacctaaaa aagttgcaat actgacagca 120 
52 ggcggcttgg cgccttgttt gaattccgca atcggtagtt tgatcgaacg ttataccgaa 180 
54 atcgatccta gcatagaaat catttgctat cgcggcggtt ataaaggcct gttgctgggc 240 
56 gattcttatc cagtaacggc cgaagtgcgt aaaaaggcgg gtgttctgca acgttttggc 300 
58 ggttctgtga tcggcaacag ccgcgtcaaa ttgaccaatg tcaaagactg cgtgaaacgc 360 
60 ggtttggtca aagagggtga agatccgcaa aaagtcgcgg ctgatcaatt ggttaaggat 420 
62 ggtgtcgata ttctgcacac catcggcggc gatgatacca atacggcagc agcggatttg 480 
64 gcagcattcc tggccagaaa taattacgga ctgaccgtca ttggtttacc taaaaccgtc 540 
66 gataacgacg tatttccgat caagcaatca ctaggtgctt ggactgccgc cgagcaaggc 600 
68 gcgcgttatt tcatgaacgt ggtggccgaa aacaacgcca acccacgcat gctgatcgta 660 
70 cacgaagtga tgggccgtaa ctgcggctgg ctgaccgctg caaccgcgca ggaatatcgc 720 
72 aaattactgg accgtgccga gtggttgccg gaattgggtt tgactcgtga atcttatgaa 780 
74 gtgcacgcgg tattcgttcc ggaaatggcg atcgacctgg aagccgaagc caagcgcctg 840 
76 cgcgaagtga tggacaaagt cgattgcgtc aacatcttcg tttccgaagg tgccggcgtc 900 
78 gaagctatcg tcgcggaaat gcaggccaaa ggccaggaag tgccgcgcga tgcgttcggc 960 
80 cacatcaaac tggatgcggt caaccctggt aaatggttcg gcgagcaatt cgcgcagatg 1020 
82 ataggcgcgg aaaaaaccct ggtacaaaaa tcgggatact tcgcccgtgc ttctgcttcc 1080 
84 aacgttgacg acatgcgttt gatcaaatcg tgcgccgact tggcggtcga gtgcgcgttc 1140 
86 cgccgcgagt ctggcgtgat cggtcacgac gaagacaacg gcaacgtgtt gcgtgcgatc 1200 
88 gagtttccgc gcatcaaggg cggcaaaccg ttcaatatcg acaccgactg gttcaatagc 1260 
90 atgttgagcg aaatcggcca gcctaaaggc ggtaaagtcg aagtcagcca c 13" 1 " 
93 <210> SEQ ID NO: 2 

95 <211> LENGTH: 437 
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RAW SEQUENCE LISTING DATE: 09/18/2001 

PATENT APPLICATION: US/09/941 f 947 TIME: 10:40:16 

Input Set : A:\CL1903 US NA Seq Listing.txt 
Output Set: N:\CRF3\09182001\I941947.raw 

97 <212> TYPE: PRT 

99 <213> ORGANISM: Methylomonas 16a 



103 


<400> SEQUENCE: 


2 
























105 


Asp 


Val 


Val 


Thr 


Trp 


Pro 


Tyr 


His 


Leu 


Thr 


Ala 


Asp 


He 


Arg 


Phe 


Cys 


106 


1 








5 










10 










15 




109 


His 


Trp 


Phe 


Phe 


Leu 


Asn 


Phe 


Asn 


Phe 


Tyr 


Thr 


Leu 


Met 


Asn 


Lys 


Pro 


110 








20 










25 










30 






113 


Lys 


Lys 


Val 


Ala 


He 


Leu 


Thr 


Ala 


Gly 


Gly 


Leu 


Ala 


Pro 


Cys 


Leu 


Asn 


114 






35 










40 










45 








117 


Ser 


Ala 


He 


Gly 


Ser 


Leu 


He 


Glu 


Arg 


Tyr 


Thr 


Glu 


He 


Asp 


Pro 


Ser 


118 




50 










55 










60 










121 


He 


Glu 


He 


He 


Cys 


Tyr 


Arg 


Gly 


Gly 


Tyr 


Lys 


Gly 


Leu 


Leu 


Leu 


Gly 


122 


65 










70 










75 










80 


125 


Asp 


Ser 


Tyr 


Pro 


Val 


Thr 


Ala 


Glu 


Val 


Arg 


Lys 


Lys 


Ala 


Gly 


Val 


Leu 


126 










85 










90 










95 




129 


Gin 


Arg 


Phe 


Gly 


Gly 


Ser 


Val 


He 


Gly 


Asn 


Ser 


Arg 


Val 


Lys 


Leu 


Thr 


130 








100 










105 










110 






133 


Asn 


Val 


Lys 


Asp 


Cys 


Val 


Lys 


Arg 


Gly 


Leu 


Val 


Lys 


Glu 


Gly 


Glu 


Asp 


134 






115 










120 










125 








137 


Pro 


Gin 


Lys 


Val 


Ala 


Ala 


Asp 


Gin 


Leu 


Val 


Lys 


Asp 


Gly 


Val 


Asp 


He 


138 




130 










135 










140 










141 


Leu 


His 


Thr 


He 


Gly 


Gly Asp 


Asp 


Thr 


Asn 


Thr 


Ala 


Ala 


Ala 


Asp 


Leu 


142 


145 










150 










155 










160 


145 


Ala 


Ala 


Phe 


Leu 


Ala 


Arg 


Asn 


Asn 


Tyr 


Gly 


Leu 


Thr 


Val 


lie 


Gly 


Leu 


146 










165 










170 










175 




149 


Pro 


Lys 


Thr 


Val 


Asp 


Asn 


Asp 


Val 


Phe 


Pro 


He 


Lys 


Gin 


Ser 


Leu 


Gly 


150 








180 










185 










190 






153 


Ala 


Trp 


Thr 


Ala 


Ala 


Glu 


Gin 


Gly 


Ala 


Arg 


Tyr 


Phe 


Met 


Asn 


Val 


Val 


154 






195 










200 










205 








157 


Ala 


Glu 


Asn 


Asn 


Ala 


Asn 


Pro 


Arg 


Met 


Leu 


lie 


Val 


His 


Glu 


Val 


Met 


158 




210 










215 










220 










161 


Gly 


Arg 


Asn 


Cys 


Gly 


Trp 


Leu 


Thr 


Ala 


Ala 


Thr 


Ala 


Gin 


Glu 


Tyr 


Arg 


162 


225 










230 










235 










240 


165 


Lys 


Leu 


Leu 


Asp 

XT 


Arg 


Ala 


Glu 


Trp 


Leu 


Pro 


Glu 


Leu 


Gly 


Leu 


Thr 


Arg 


166 










245 










250 










255 




169 


Glu 


Ser 


Tyr 


Glu 


Val 


His 


Ala 


Val 


Phe 


Val 


Pro 


Glu 


Met 


Ala 


lie 


Asp 


170 








260 










265 










270 






173 


Leu 


Glu 


Ala 


Glu 


Ala 


Lys 


Arg 


Leu 


Arg 


Glu 


Val 


Met 


Asp 


Lys 


Val 


Asp 


174 






275 










280 










285 








177 


Cys 


Val 


Asn 


He 


Phe 


Val 


Ser 


Glu 


Gly 


Ala 


Gly 


Val 


Glu 


Ala 


He 


Val 


178 




290 










295 










300 










181 


Ala 


Glu 


Met 


Gin 


Ala 


Lys 


Gly 


Gin 


Glu 


Val 


Pro 


Arg 


Asp 


Ala 


Phe 


Gly 


182 


305 










310 










315 










320 


185 


His 


He 


Lys 


Leu 


Asp 


Ala 


Val 


Asn 


Pro 


Gly 


Lys 


Trp 


Phe 


Gly 


Glu 


Gin 


186 










325 










330 










335 




189 


Phe 


Ala 


Gin 


Met 


He 


Gly 


Ala 


Glu 


Lys 


Thr 


Leu 


val 


Gin 


Lys 


Ser 


Gly 


190 








340 










345 










350 






193 


Tyr 


Phe 


Ala 


Arg 


Ala 


Ser 


Ala 


Ser 


Asn 


Val 


Asp 


Asp 


Met 


Arg 


Leu 


lie 


194 






355 










360 










365 
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RAW SEQUENCE LISTING DATE: 09/18/2001 

PATENT APPLICATION: US/09/941,947 TIME: 10:40:16 

Input Set : A:\CL1903 US NA Seq Listing.txt 
Output Set: N:\CRF3\09182001\I941947.raw 

197 Lys Ser Cys Ala Asp Leu Ala Val Glu Cys Ala Phe Arg Arg Glu Ser 

198 370 375 380 

201 Gly Val lie Gly His Asp Glu Asp Asn Gly Asn Val Leu Arg Ala lie 

202 385 390 395 400 
20 5 Glu Phe Pro Arg lie Lys Gly Gly Lys Pro Phe Asn He Asp Thr Asp 
206 405 410 415 

209 Trp Phe Asn Ser Met Leu Ser Glu He Gly Gin Pro Lys Gly Gly Lys 

210 420 425 430 

213 Val Glu Val Ser His 

214 435 

217 <210> SEQ ID NO: 3 
219 <211> LENGTH: 636 
221 <212> TYPE: DNA 

223 <213> ORGANISM: Methylomonas 16a 

227 <400> SEQUENCE: 3 

228 gaaaatacta tgtccgtcac 
230 atggtcatca atcatctgga 
232 ttgaaagttt tggagatcac 
234 aaagccgaag taccggacgc 
236 tatcaagcga ttgacgccgg 
238 ctcaacgaag cgctagcatc 
24 0 gtcatgcgtt tattggaaaa 
242 ggcggcatac cgatgctgaa 
244 accggcggcg tcaatcccaa 
246 gtcggcggct cctggatggc 
24 8 acgcggcggg cgagcgaggc 
251 <210> SEQ ID NO: 4 
253 <211> LENGTH: 212 



255 


<212> TYPE: 


PRT 


























257 


<213> ORGANISM: 


Methylomonas 16a 
















261 


<4 00> SEQUENCE: 


4 
























263 


Glu 


Asn 


Thr 


Met 


Ser 


Val 


Thr 


He 


. Lys 


Glu 


Val 


Met 


Thr 


Thr 


Ser 


Pro 


264 


1 








5 










10 










15 




267 


Val 


Met 


Pro 


Val 


Met 


Val 


He 


Asn 


His 


Leu 


Glu 


His 


Ala 


Val 


Pro 


Leu 


268 








20 










25 










30 






271 


Ala 


Arg 


Ala 


Leu 


Val 


Asp 


Gly 


Gly 


Leu 


Lys 


Val 


Leu 


Glu 


He 


Thr 


Leu 


272 






35 










40 










45 








275 


Arg 


Thr 


Pro 


Val 


Ala 


Leu 


Glu 


Cys 


He 


Arg 


Arg 


He 


Lys 


Ala 


Glu 


Val 


276 




50 










55 










60 










279 


Pro 


Asp 


Ala 


He 


Val 


Gly 


Ala 


Gly 


Thr 


He 


He 


Asn 


Pro 


His 


Thr 


Leu 


280 


65 










70 










75 










80 


283 


Tyr 


Gin 


Ala 


He 


Asp 


Ala 


Gly 


Ala 


Glu 


Phe 


He 


Val 


Ser 


Pro 


Gly 


He 


284 










85 










90 










95 




287 


Thr 


Glu 


Asn 


Leu 


Leu 


Asn 


Glu 


Ala 


Leu 


Ala 


Ser 


Gly 


Val 


Pro 


He 


Leu 


288 








100 










105 










110 






291 


Pro 


Gly 


Val 


He 


Thr 


Pro 


Ser 


Glu 


Val 


Met 


Arg 


Leu 


Leu 


Glu 


Lys 


Gly 


292 






115 










120 










125 








295 


He 


Asn 


Ala 


Met 


Lys 


Phe 


Phe 


Pro 


Ala 


Glu 


Ala 


Ala 


Gly 


Gly 


lie 


Pro 


296 




130 










135 










140 
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catcaaagaa gtcatgacca cctcgcccgt tatgccggtc 60 

acatgccgtc cctctggctc gcgcgctagt cgacggtggc 120 

attgcgcacg ccggtggcac tggaatgtat ccgacgtatc 180 

catcgtcggc gcgggcacca tcatcaaccc tcataccttg 240 

tgcggaattc atcgtcagcc ccggcatcac cgaaaatcta 300 

cggcgtgcct atcctgcccg gcgtcatcac acccagcgag 360 

aggcatcaat gcgatgaaat tctttccggc tgaagccgcc 420 

atcccttggc ggccccttgc cgcaagtcac cttctgtccg 480 

aaacgcgccc gaatatctgg cattgaaaaa tgtcgcctgc 540 

gccggccgat ctggtagatg ccgaagactg ggcggaaatc 600 

cgcggcattg aaaaaa 636 



RAW SEQUENCE LISTING DATE: 09/18/2001 

PATENT APPLICATION: US/09/941 , 947 TIME: 10:40:16 



Input Set : A:\CL1903 US NA Seq Listing.txt 
Output Set: N:\CRF3\09182001\I941947.raw 



299 Met Leu Lys Ser Leu Gly Gly Pro Leu Pro Gin 

300 145 150 155 

303 Thr Gly Gly Val Asn Pro Lys Asn Ala Pro Glu 

304 165 170 

307 Asn Val Ala Cys Val Gly Gly Ser Trp Met Ala 

308 180 185 

311 Asp Ala Glu Asp Trp Ala Glu lie Thr Arg Arg 

312 195 200 

315 Ala Leu Lys Lys 

316 210 

319 <210> SEQ ID NO: 5 

321 <211> LENGTH: 1860 

323 <212> TYPE: DNA 

325 <213> ORGANISM: Methylomonas 16a 

329 <400> SEQUENCE: 5 

330 atgaaactga ccaccgacta tcccttgctt aaaaacatcc 
332 gcgctgtcca aggaccagct ccagcaactg gctgacgagg 
334 acggtcagca tttccggcgg ccattttgcg gccggcctcg 
336 gccttgcatt atgtgttcaa tacccccgtc gatcagttgg 
338 gcctatccgc acaagattct gaccggtcgc aaggagcgca 
34 0 ggcggggtgt cagcctttcc ggcgcgggac gagagcgaat 
342 cattccagca cctcgatcag cgcggcactg ggcatggcca 
344 gaagacaaga agatggtagc catcatcggc gacggttcca 
34 6 gaggcgatga atcatgccgg cgatgtgaat gccaacctgc 
348 gatatgtcga tctcgccgcc ggtcggggcg atgaacaatt 
350 agcaagtttt attcgtcggt gcgggaagag agcaagaaag 
352 gtgtgggaac tggcgcgcaa gaccgaggaa cacgtgaagg 
354 ttgttcgagg aattgggctt caattatttc ggcccgatcg 
356 ctggtgtcga ccctggaaaa tctgaaggat ttgaccgggc 
358 accaagaagg gcaaaggcta tgcgccagcc gagaaagacc 
360 ccggctttcg atccgaccaa ggatttcctg cccaaggcgg 
362 tataccgagg tgttcggccg ctggctgtgc gacatggcgg 
364 ggcatcacgc cggcgatgcg cgaaggctct ggtttggtgg 
366 aatcgctatt tcgatgtcgc catcgccgag cagcatgcgg 
368 gcctgccagg gcgccaagcc ggtggtggcg atttattcca 
370 gatcagttga tccacgacgt ggccttgcag aacttagata 
372 gccggcttgg tcggcccgga tggaccgacc catgctggcg 
374 cgctgtattc cgaacatgct gatcatggct ccagccgacg 
376 ctgaccaccg gcttccaaca ccatggcccg gcttcggtgc 
378 cccggggcgg caatcgatcc gaccctgacc gcgctggaga 
380 caccacggca gccgcatcgc cattctggcc tggggcagca 
382 gccggcaagc agctgggcgc gacggtggtg aacatgcgtt 
384 gccttggtgc tggaattggc caggacgcac gatgtgttcg 
386 atcgccggcg gcgctggcag tgcgatcaac accttcctgc 
388 ccggtctgca acatcggcct gcccgaccgc ttcgtcgagc 
390 ctcagcctgg tcggcctcga cagcaagggc atcctcgcca 
393 <210> SEQ ID NO: 6 

395 <211> LENGTH: 620 

397 <212> TYPE: PRT 



Val Thr Phe Cys Pro 
160 

Tyr Leu Ala Leu Lys 
175 

Pro Ala Asp Leu Val 
190 

Ala Ser Glu Ala Ala 
205 



acacgccggc 


ggacatacgc 


60 


tgcgcggcta 


tctgacccac 


120 


gcaccgtgga 


actgaccgtg 


180 


tctgggacgt 


gggccatcag 


240 


tgccgaccat 


tcgcaccctg 


300 


acgatgcctt 


cggcgtcggc 


360 


ttgcgtcgca 


gctgcgcggc 


420 


tcaccggcgg 


catggcctat 


480 


tggtgatctt 


gaacgacaac 


540 


atctgaccaa 


ggtgttgtcg 


600 


ctctggccaa 


gatgccgtcg 


660 


gcatgatcgt 


gcccggtacc 


720 


acggccatga 


tgtcgagatg 


780 


cggtattcct 


gcatgtggtg 


840 


cgttggccta 


ccatggcgtg 


900 


cgccgtcgcc 


gcatccgacc 


960 


ctcaagacga 


gcgcttgctg 


1020 


aattctcaca 


gaaatttccg 


1080 


tgaccttggc 


cgccggccag 


1140 


ccttcctgca 


acgcggttac 


1200 


tgctctttgc 


actggatcgt 


1260 


cctttgatta 


cagctacatg 


1320 


agaacgagtg 


caggcagatg 


1380 


gctatccgcg 


cggcaaaggg 


1440 


tcggcaaggc 


cgaagtcaga 


1500 


tggtcacgcc 


tgccgtcgaa 


1560 


tcgtcaagcc 


gttcgatcaa 


1620 


tcaccgtcga 


ggaaaacgtc 


1680 


aggcgcagaa 


ggtgctgatg 


1740 


aaggtagtcg 


cgaggaattg 


1800 


ccatcgaaca 


gttttgcgct 


1860 
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399 


<213> ORGANISM: 


Me thy 1 omon as 16a 
















403 


<4 00> SEQUENCE: 


6 
























405 


Met 


Lys 


Leu 


Thr 


Thr 


Asp 


Tyr 


Pro 


Leu 


Leu 


Lys 


Asn 


He 


His 


Thr 


Pro 


406 


1 








5 










10 










15 




409 


Ala 


Asp 


lie 


Arg 


Ala 


Leu 


Ser 


Lys 


Asp 


Gin 


Leu 


Gin 


Gin 


Leu 


Ala 


Asp 


410 






20 










25 










30 






413 


Glu 


Val 


Arg 


Gly 


Tyr 


Leu 


Thr 


His 


Thr 


Val 


Ser 


He 


Ser 


Gly 


Gly 


His 


414 






35 










40 










45 








417 


Phe 


Ala 


Ala 


Gly 


Leu 


Gly 


Thr 


Val 


Glu 


Leu 


Thr 


Val 


Ala 


Leu 


His 


Tyr 


418 




50 










55 










60 










421 


Val 


Phe 


Asn 


Thr 


Pro 


Val 


Asp 


Gin 


Leu 


Val 


Trp 


Asp 


Val 


Gly 


His 


Gin 


422 


65 










70 










75 










80 


425 


Ala 


Tyr 


Pro 


His 


Lys 


He 


Leu 


Thr 


Gly 


Arg 


Lys 


Glu 


Arg 


Met 


Pro 


Thr 


426 










85 










90 










95 




429 


lie 


Arg 


Thr 


Leu 


Gly 


Gly 


Val 


Ser 


Ala 


Phe 


Pro 


Ala 


Arg 


Asp 


Glu 


Ser 


430 








100 










105 










110 






433 


Glu 


Tyr 


Asp 


Ala 


Phe 


Gly 


Val 


Gly 


His 


Ser 


Ser 


Thr 


Ser 


He 


Ser 


Ala 


434 






115 










120 










125 








437 


Ala 


Leu 


Gly 


Met 


Ala 


He 


Ala 


Ser 


Gin 


Leu 


Arg 


Gly 


Glu 


Asp 


Lys 


Lys 


438 




130 










135 










140 










441 


Met 


Val 


Ala 


He 


He 


Gly 


Asp 


Gly 


Ser 


He 


Thr 


Gly 


Gly 


Met 


Ala 


Tyr 


442 


145 










150 










155 










160 


445 


Glu 


Ala 


Met 


Asn 


His 


Ala 


Gly 


Asp 


Val 


Asn 


Ala 


Asn 


Leu 


Leu 


Val 


He 


446 










165 










170 










175 




449 


Leu 


Asn 


Asp 


Asn 


Asp 


Met 


Ser 


He 


Ser 


Pro 


Pro 


Val 


Gly 


Ala 


Met 


Asn 


450 








180 










185 










190 






453 


Asn 


Tyr 


Leu 


Thr 


Lys 


Val 


Leu 


Ser 


Ser 


Lys 


Phe 


Tyr 


Ser 


Ser 


Val 


Arg 


454 






195 










200 










205 








457 


Glu 


Glu 


Ser 


Lys 


Lys 


Ala 


Leu 


Ala 


Lys 


Met 


Pro 


Ser 


Val 


Trp 


Glu 


Leu 


458 




210 










215 










220 










461 


Ala 


Arg 


Lys 


Thr 


Glu 


Glu 


His 


Val 


Lys 


Gly 


Met 


lie 


Val 


Pro 


Gly 


Thr 


462 


225 










230 










235 










240 


465 


Leu 


Phe 


Glu 


Glu 


Leu 


Gly 


Phe 


Asn 


Tyr 


Phe 


Gly 


Pro 


He 


Asp 


Gly 


His 


466 










245 










250 








• 


255 




469 


Asp 


Val 


Glu 


Met 


Leu 


Val 


Ser 


Thr 


Leu 


Glu 


Asn 


Leu 


Lys 


Asp 


Leu 


Thr 


470 








260 










265 










270 






473 


Gly 


Pro 


Val 


Phe 


Leu 


His 


Val 


Val 


Thr 


Lys 


Lys 


Gly 


Lys 


Gly 


Tyr 


Ala 


474 






275 










280 










285 








477 


Pro 


Ala 


Glu 


Lys 


Asp 


Pro 


Leu 


Ala 


Tyr 


His 


Gly 


Val 


Pro 


Ala 


Phe 


Asp 


478 




290 










295 










300 










481 


Pro 


Thr 


Lys 


Asp 


Phe 


Leu 


Pro 


Lys 


Ala 


Ala 


Pro 


Ser 


Pro 


His 


Pro 


Thr 


482 


305 










310 










315 










320 


485 


Tyr 


Thr 


Glu 


Val 


Phe 


Gly 


Arg 


Trp 


Leu 


Cys 


Asp 


Met 


Ala 


Ala 


Gin 


Asp 


486 










325 










330 










335 




489 


Glu 


Arg 


Leu 


Leu 


Gly 


He 


Thr 


Pro 


Ala 


Met 


Arg 


Glu 


Gly 


Ser 


Gly 


Leu 


490 








340 










345 










350 






493 


Val 


Glu 


Phe 


Ser 


Gin 


Lys 


Phe 


Pro 


Asn 


Arg 


Tyr 


Phe 


Asp 


Val 


Ala 


He 


494 






355 










360 










365 








497 


Ala 


Glu 


Gin 


His 


Ala 


Val 


Thr 


Leu 


Ala 


Ala 


Gly 


Gin 


Ala 


Cys 


Gin 


Gly 
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4/1 



'till! 



<210> SEQ ID NO 41 
<211> LENGTH: 38 
<212> TYPE: DNA ^ 

<213> ORGANISM : /Artificial Sequence 
<220> FEATURE: 
<223> OTHER INFORMATTOKT 
<400> SEQUENCE: 41 



&4 





38 



agcagctagc ggaggaataa accatgagcg catttctc 



<210> SEQ ID NO 42 
<211> LENGTH: 26 
<212> TYPE: DNA 
<213> ORGANISM: 
<220> FEATURE 



C^tificial Sequ^ e-^ 

<223> OTHER INFORMATiqifT^T 
<400> SEQUENCE: 42 — 



26 



gactagtcac gacctgctcg aacgac 



j. otto* OkUWtsM #dJ h J) ^-tr^ 



if 



The type of errors shown exist throughout 
the Sequence Listing. Please check subsequent 
sequences for similar errors. 




• 
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PATENT APPLICATION: US/09/941,947 



DATE : 
TIME: 



09/18/2001 
10:40:17 



Input Set : A:\CL1903 US NA Seq Listing.txt 
Output Set: N:\CRF3\09182001\I941947.raw 



L:23 M:270 C: Current Application Number differs , Replaced Current Application No 

L:23 M-.271 C: Current Filing' Date differs; Replaced Current Filing Date 

L:3057 M:220 C: Keyword misspelled or invalid format, <213> ORGANISM for SEQ ID#:41 

L:3061 M:258 W: Mandatory Feature missing, <220> FEATURE: 

L:3061 M:258 W: Mandatory Feature missing, <223> OTHER INFORMATION: 

L:3071 M:220 C: Keyword misspelled or invalid format, <213> ORGANISM for SEQ ID#:42 

L:3075 M:258 W: Mandatory Feature missing, <220> FEATURE: 

L:3075 M:258 W: Mandatory Feature missing, <223> OTHER INFORMATION: 
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